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Abstract 

For two or more classes (or types) of points, nearest neighbor contingency tables (NNCTs) are con- 
structed using nearest neighbor (NN) frequencies and are used in testing spatial segregation of the classes. 
Pielou's test of independence, Dixon's cell-specific, class-specific, and overall tests are the tests based on 
NNCTs (i.e., they are NNCT-tests ). These tests are designed and intended for use under the null pattern 
of random labeling (RL) of completely mapped data. However, it has been shown that Pielou's test is 
not appropriate for testing segregation against the RL pattern while Dixon's tests are. In this article, 
we compare Pielou's and Dixon's NNCT-tests; introduce the one-sided versions of Pielou's test; extend 
the use of NNCT-tests for testing complete spatial randomness (CSR) of points from two or more classes 
(which is called CSR independence, henceforth). We assess the finite sample performance of the tests 
by an extensive Monte Carlo simulation study and demonstrate that Dixon's tests are also appropriate 
for testing CSR independence; but Pielou's test and the corresponding one-sided versions are liberal for 
testing CSR independence or RL. Furthermore, we show that Pielou's tests are only appropriate when 
the NNCT is based on a random sample of (base, NN) pairs. We also prove the consistency of the tests 
under their appropriate null hypotheses. Moreover, we investigate the edge (or boundary) effects on 
the NNCT-tests and compare the buffer zone and toroidal edge correction methods for these tests. We 
illustrate the tests on a real life and an artificial data set. 
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spatial point pattern 
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1 Introduction 



The analysis of spatial point patterns in natural populations has been extensively studied in various fields. 
In particular, spatial patterns in epidemiology, population biology, and ecology have important implications. 
A spatial point pattern includes the locations of some measurements, such as the coordinates of trees in a 
region of interest. These locations are referred to as events by some authors, in order to distinguish them 
from arbitrary points in the region of interest (jPiggld (|2003[) ). However in this article such a distinction is 
not necessary, as we only consider the locations of events. Hence points will refer to the locations of events, 
henceforth. Most point patterns also include other types of measurements for each point, such as categorical 
label (e.g., species label) or size (e.g., height of pine saplings). Such labelled data are marked point patterns 
generated by marked point processes, which define the distributions of the "marks" or "labels" to the locations 
of the points and p erhaps are th e most common spatial point patterns. For a general discu ssion of marked 
point processes, see Diggld ( 2003 ). Gavrikov and Stovan (1995), Penttinen et al. ( 1992f) . and Schlather et al. 
(200J). For convenience and generality, we call the different types of points as "classes". From the early days 



on, the related research has mostly been on only one class at a time; i.e., on spatial pattern of each class (e.g., 
density, dumpiness , etc.). These patter ns in a one-class framework fall under the pattern category called 
spatial aggregation ( Coomes et al.l ( 19991 )) or clustering. H owever, it is a lso of practical interest to investigate 
the patterns of one class with respect to the other classes (Pielou ( 196l[ )'). The spatial relationships b etween 
two o r more c lasses have impor t ant co nsequences especially for plant species. See, for example, iPielou 
( 196ll) . lDixo"3 (|l994l ). and|Di2con| (|2002al) for more detail. Although we refer to types of points as "classes" , 
the "class" can be replaced by any characteristic of an observation a t a par t icular location. For example, 
the pattern of spatial seg,r egation has been investigated f or species (IPiggld (I2003I) '). age classes of plants 
( Hamill and Wrighd (|l986l) ) and sexes of dioecious plants (jNanami et al.l (|l999l )V We also note that many 
of the epidemiological applications are for a two-class system of case/control labels (jWaller and Gotwav 
( 200l l 



In various fields, there are many tests available for spatial point patterns. An extensive survey is provided 
by Kulldorff who enumerates mor e than 100 such tests, most of which need adjustment for some sort 
of inhomog eneity (iKuUdorfj (120061) ). He also provides a general framewor k to classify these tests. The 
most widely used tests include Pielou's test of segregation for tw o classe s (IPieloul ( 196ll) ) due to its ease 
of computation and interpretation and Ripley's K or L- functions ( Riplevl (|2004l )). The abundance of tests 



results because (i) the tests for which Monte Carlo critical values are the only criteria receive wide acceptance 
in various fields; (ii) there are many different types of segregation patterns and some tests are designed to 
detect only certain types of segregation patterns; and (iii) the lack of cross-fertilization between different 
scientific fields so that new tests are proposed unbeknownst to the developers of similar tests. 



Nea rest neighbor (NN) methods for spatial patterns include at least six different groups (see, e.g.. lDixon 
(l2002bh '). The methods utilize some measure of (dis) similarity between a point and its NN; such as the 
distance between the points or the class types of the points. The latter type of similarity is used in the 
NN methods in this article. Nearest neighbor contingency t ables (NNCTs ) are constructed using the NN 
frequencies of classes and are used in testing spatial patterns. Pieloul (|l96lh proposed tests (for segregation, 
symmetry, niche specificity, etc.) based on NNCTs under the RL of locations in the study re gion an d Dixo n 
devised cell-specific, class-specific, and overall tests bas ed on NNCTs for the two-class case (jPixon (119941) ') 
and extended his methodology to the multi-class case l|Dixonl (|2002a[ l) under RL. Pielou's tests have been 
used for the two-class case only. However it has been demonstrated tha t Pielo u's test is not appropriate for 
the NNCTs constructed under the RL of points ( Meagher and Burdickl ( 1980l )). 



In this article, we discuss the tests of spatial segregation based on NNCTs. We describe the assumptions 
and hypotheses, the tests, and the underlying sampling frameworks for Pielou's and Dixon's tests. We 
propose one-sided versions of Pielou's test to detect the direction of deviation from the RL pattern; then 
extend the use of Pielou's and Dixon's tests for the CSR of points from two or more classes in the region of 
interest. However, we demonstrate that under CSR independence, Dixon's tests are conditional tests, and 
propose a method to remove this conditional nature of Dixon's test. We also compare the empirical sizes of 
the NNCT-tests with an extensive Monte Carlo simulation study, where we demonstrate that Pielou's test 
and the corresponding one-sided versions are liberal for rejecting RL or CSR independence, while Dixon's 
tests are about the desired nominal level. We also prove the consistency of the tests under their appropriate 



null hypotheses; show that Pielou's test is only appropriate when the NNCT is based on a random sample of 
(base, NN) pairs (which is not realistic in practical situations). We also investigate the edge (or boundary) 
effects on the NNCT-tests under CSR independence only (since edge effects is not a concern under RL). 

We describe the spatial point patterns of RL and CSR independence in Section [21 describe the NNCT- 
tests in Section 131 in particular we describe the construction of the NNCTs in Section [XTl Pielou's test in 
Section 13.21 Dixon's NNCT-tests in Section 13.31 extend Dixon's test for the CSR independence pattern in 
Section [3^ We prove the consistency of the NNCT-tests in Section |4] (and defer the proofs to the Appendix 
Section); present our extensive Monte Carlo simulation analysis in Section O in particular we compare the 
empirical significance levels of the tests under RL in Section 15.11 under CSR independence in Section 15.21 
under the independence of rows and cell counts in NNCTs in Section [5T51 We also consider the edge correction 
methods under the CSR independence pattern in Section [SI illustrate our methods on two example data sets 
in Section [71 We provide our discussions and conclusions as well as guidelines for using the tests in Section 

m 



2 Spatial Point Patterns 



For simplicity, we describe the spatial point patterns for two-class populations; the extension to the multi- 
class case is straightforward. 

In the univariate (i.e., one-clas s ) spa tial point pattern analysis, the null hypothesis is usually complete 
spatial randomness (CSR) ( Diggle ( 2003h ). Given a spatial point pattern V = {Xi ■ I{Xi e D), i — 1, . . . ,n : 



D C K^} where Xi stands for the location of event i (i.e., point i) and l{Xi € D) is the indicator function 
which denotes the Bernoulli random variable denoting the event that point i is in region D. The pattern 
V exhibits CSR if given n events (i.e., locations of the points) in domain D, the events are an independent 
random sample from the uniform distribution on D. Note that this condition also implies that there is 
no spatial interaction; i.e., the locations of these points have no influence on one another. Furthermore, 
when the reference region D is large, the number of points in any planar region with area A{D) follows 
(approximately) a Poisson distribution with intensity (i.e., number of points per unit area) denoted by A 
and mean A • A{D). 

To investigate the spatial interaction between two or more classes in a bivariate process, usually there 
are two benchmark hypotheses: (i) independence, which implies two classes of points are generated by a pair 
of independent univariate processes and (ii) random labeling (RL) , which implies that the class labels are 
randomly assigned to a given set of locations in the region of interest (jDiggle (|2003M In this article, we will 



consider two random pattern types as our null hypotheses: CSR of points from two classes (this pattern will 
be called CSR independence, henceforth) or RL. 

In CSR independence, points from each of the two classes satisfy the CSR pattern in the region of interest. 
On the other hand, RL is the pattern in which, given a fixed set of locations in a region, class labels are 
assigned to these fixed locations randomly so that the labels are independent of the locations. So, RL is less 
restrictive than CSR independence, in the sense that RL does not impose any restrictions on the distribution 
of the locations of the events, but CSR independence is a process defining the spatial distribution of event 
locations. The RL or CSR independence patterns imply a more refined null hypothesis for the NNCT- 
tests, namely, Ho : randomness in the NN structure. When the points from each class are assumed to be 
uniformly distributed over the region of interest, then randomness in the NN structure is implied by the 
CSR ind ependence pattern, which is also referred to as (a type of) "population independence" by some 
authors ( Goreaud and Pelissiei (|2003h ). Note that this type of CSR is equivalent to the case where the RL 



procedure is applied to a given set of points from a CSR pattern in the sense that after points are generated 
uniformly in the region, the class labels are assigned randomly. When only the labeling of a set of fixed 
points (the allocation of the points could be regular, aggregated, or clustered, or of lattice type) is random, 
the randomness in the NN structure is implied by RL pattern. 

The distinction between the RL and CSR independence is ver y important when defin i ng the appropriate 
null model which depends on the particular ecological context. iGoreaud and Pelissier (l2003l) discuss the 



differences between independence and RL patterns and show that the incorrect specification of the nuU 
pattern may result in incorrect results, e.g., for Ripley's K or L-functions. They also propose some guidelines 
to determine which n ull hypothesis is appropriate fo r a given situation. For the null case of CSR independence 
(just independence in lGoreaud and Pelissied ( "2003| )) the locations of the points from two classes are a priori 



the result of (perhaps) different processes (e.g., individuals of different species or age cohorts), whereas for 
the null case of RL some processes affect a posteriori the individuals of a single population (e.g., diseased 
versus non-diseased individuals of a single species). Notice that although CSR independence and RL are 
not same, they lead to the same null model (i.e., randomness in NN structure) for tests using NNCT, which 
does not require spatially-explicit information. 

Deviations from the null patterns (RL or CSR independence) were first called (positive or negative) 
segregation. In Pielou's approach, two classes can be described as "unsegregated" if the NN of an individual 
is as likely to be of the same class as the other class; that is, neither class has a tendency to occur in one-class 
clumps or clusters. Negative segregation occurs if the NN of a point is more likely to be from a different class 
than the class of the point. Positive segregation occurs if the NN of a point is more likely to be of the same 
class a s the c lass of the point; i.e., the members of the same class tend to be clumped or clustered (see, e.g., 
Pieloul (UmiI)). The concept of "negative segregation" as described above, is more com monly referred to as 



association, whereas "positive segregation" is merely called segregation. See, for example, Cressid ( 1993|) and 



Coomes et al. for more detail. Two classes may exhibit many different forms of segregation (jPieloul 



(Il96lh ). Although it is not possible to list all segregation types, existence of segregation can be tested by 



using NNCTs. In the statistical literature, association in contingency tables generally refers to categorical 
association. To avoid confusion between this general association and the spatial pattern of association, we 
call the former as "categorical association" and the latter as "spatial association" . No such confusion occurs 
for segregation. 



3 Tests Based on Nearest Neighbor Contingency Tables 

In this section, we present the construction of NNCTs and then Pielou's and Dixon's tests based on NNCTs. 



3.1 Construction of Nearest Neighbor Contingency Tables 

Consider two classes labeled as {1,2}. NNCTs are constructed using NN frequencies for each class. Let Ui 
be the number of points from class i for i S {1,2} and n = ni -\- n2. If we record the class of each point 
and its NN, the NN relationships fall into 4 categories: (1,1), (1,2); (2,1), (2,2), where in cell {i,j), class 
i is the base class, while class j is the class of its NN. That is, a (base, NN) pair is categorized according 
to its label. Denoting Nij as the observed frequency of cell {i,j) for i,j £ {1,2}, we obtain the NNCT in 
Table [1] where Cj is the sum of column j; i.e., number of times class j points serve as nearest neighbors for 
j € {1, 2}. Note also that n = j ^ij^ = T^j=i ^'^'^ ~ We adopt the convention that 

capital letters stand for random quantities, while lower case letters stand for fixed quantities. 

Let TTij = P{JJ = i,V = j) be the probability that the pair of points {U,V) falls in cell (i, j); i.e., the 
point V is from class j and is a NN of the point U which is from class i. Furthermore, let i^i = tth + ■ni2 for 
i = 1, 2, that is, the probability of a base point to be of class i. Similarly, let Kj = -Kij +1^2] for j = 1, 2, that 
is, the probability of a NN point to be of class j. The sample versions of these probabilities are TTy = Nij/n 
for TTy, Vi = Ui/n for Vi, and Kj = Cj/n for K,j. 

A (base, NN) pair can be categorized as reflexive or non-reflexive, regardless of the classes of the members 
of the pair. For a (base, NN) pair, {X,Y), (i.e., F is a NN of X), if X is a NN of Y (i.e., {Y,X) is also a 
(base, NN) pair), then the pair {X,Y) is called reflexive. If a (base, NN) pair is not reflexive, then it is a 
non-reflexive pair. Moreover, a point can serve as NN to none or s ever al other points. That is, a point can 
be a shared NN to k {k < 6) other points in M? (| Clark and EvansI (fl955i )). 



3.2 Pielou's Test of Segregation 



Pielou constructed NNCTs based on NN frequencies which yield tests that are independent of quadrat size 
( Pieloul (Eieih . lKrebsl (Il972l )). : n the two-class case, Pi elou us e d the usual Pearson's test of independence 
(with 1 df) to test for presence or lack of segregation ( Pieloul ( 196ll)). Due to the ease in co mputation and 
interpretation, her test of segregation is widely used in ecology ( Meagher and Burdick ( 1980l )') for both com- 
pletely mapped or sparsely sampled data. In particular, Pielou has described and used her test of segregation 



for co mplet ely mapped d ata, although her test is not appropriate for such data (see iMeagher and Burdick 
(Il98d) and iDixonI ( Il994l) 'l. A data set is completely mapped, if the points (i.e., locations of all events) in 
a defined space are observed. Alternatively, sparse sampling might be suitable for the use of Pielou's test 
as suggested by iDixoiil ( 1994 ). Although sparse sampling is not clearly defined in the literature, it can be 
classified into two types. The sparse sampling schemes depend on the events (members of a class occupying 
a location) or arbitrary points in the region of interest. The most well known sparse sampling method is the 
quadrat sampling, in which the number of events falling into each of several (preferably random) small sub- 
regions (quadrats) is recorded. However construction of NNCTs may not even be possible in such a scheme, 
since the NN information may be lost when only the number of events are recorded for each quadrat. The 
second sparse sampling scheme is the distance sampling, in which the basic unit is an arbitrary point (not 
necessarily fr om the events) and the information based on the distance to the nearest event is recorded 
(Solow 1 19891) 1 Notice that this type of distance sampling scheme does not yield sufiicient information for 
the construction of NNCTs. 



Pearson's test of independence for a 2 x 2 contingency table, in general, can be assumed to develop from 
one of the following frameworks: Poisson, row-wise binomial, or overall multinomial sampling frameworks. 
Below we briefly describe these frameworks for 2 x 2 contingency tables. Let 7fy be the probability of a 
point to fall in cell {i,j) in the contingency table, and ui and Hj be the probabilities that the point is of row 
category i and of column category j, respectively. 



The test statistic for Pielou's test (which is same as Pearson's test) is given by 

,--E 
E[iV, 



i=l 3 = 1 



Poisson Sampling Pramev^rork: Each category count in the contingency table is assumed to be an inde- 
pendent Poisson variate. Another key feature is the independence of cell counts, which would imply that 
T^ij = Vi Kj for all (i, j) G {1, 2}. This independence can be tested by Pearson's test for large samples and 
Fisher's exact test or the exact version of the Pearson's test for small samples ( Agresti ( 19921 )). The null 



hypothesis in this framework is Ho : T^ij = Vi Hj, and E[Nij] of Equation ([T]) is 

Ha 
Ha 



Among the alternatives. 



TTii > Vi Hi is suggestive of positive categorical association for classes i = 1,2, while Ha 
7^21 < 1^2 is suggestive of negative categorical association between classes 1 and 2. 



7ri2 < 1^1 K2 or 



However, in the case of NNCTs, cell counts, e.g., A^i2 and N21 arc not independent under RL or CSR 
independence. Because, a (base, NN) pair is more likely to be a reflexive pair, rather than a non-reflexive pair 
under RL or CSR independence ( Meagher and Burdick ( 1980l )). Thus under Poisson sampling framework, 
Pielou's test would be inappropriate for testing RL or CSR independence. 



Row-wise Binomial Sampling Framework: In this framework, we assume that Ni ~ ni arc given and 
Nij ^ BIN(ni, TTy), the binomial distribution with trials and probability of success being 7?^ . Notice that 
for more than two classes, this will be row-wise multinomial framework. 

Then the null hypothesis for this test is Hg : tth = 1^21 which also implies 7ri2 — t^22- The alternative 
Ha '. TTii > TT21 would corrcspond to positive categorical association which would also imply 1122 > 7^12 • 
Similarly, the alternative Ha : tth < n2i would correspond to negative categorical association which would 
also imply 7122 < t^i2- 

Under Ho, we can parametrize the null model as Ntj ~ BIN(ni,Kj) where kj can be estimated as 
Cj/n and is assumed to equal the expectation E[Cj/n]. Then Ho : ttii = 7r2i = ki is equivalent to 



Ho : E[A^ii/ni] = E[A''2i/n2] = ki which is equivalent to Ho : ElNu] = rii ki and E[Af2i] — 1^2 ki which, for 
large n, ni, and n2, is equivalent to Ho : E[7Vii/n] = ni Hi/n = vi ki and E[Af2i/"-] = Ki/n = ?2 '52- 

Under Ho, if j are known, Xp is approximately distributed as X2 (i-^-j distribution with 2 degrees 
of freedom) for large nf, if are not known, but estimated as Cj/n, then Xp is approximately distributed 
as Xi for large n^. In most practical situations, the latter case will occur, so Xi distribution is used for this 
test. 

In the two-class case, (iVii, A^i2) and (A^2i, ^22) are assumed to be independent and so are the individual 
trials, namely, (base, NN) pairs. Under RL or CSR independence, this assumption is invalid for completely 
mapped data. Because the trials that constitute Nu for i = 1,2 are not independent due to reflexivity and 
shared NN structure; likewise, A'^12 and N21 are not independent. Hence, Pielou's test is not appropriate for 
RL or CSR independence in this framework cither. 

Overall Multinomial Sampling Framework: An alternative sampling framework for contingency tables, 
in general, is that the cell counts arc assumed to be from independent multinomial trials. That is, for the 
two-class case, 

N = (A^ll,7Vl2,iV2i,7V22) ^ ^(ri,7fll,Tri2,Tf21,7r22) 

hence the name overall multinomial framework. The null hypothesis in this framework is Ho '■ (7rii,7ri2) = 
(7r2i,Tr22) and E[A'^ij] in Equation ^ is NiCj/n. The multinomial counts are not independent (since they 
are negatively correlated) when conditioned on their total. This dependence alleviates as the sample size 
increases, but might confound the small sample results. In addition to this mild dependence, the NNCT 
cell counts are not independent due to, e.g., reflexivity. Hence the overall multinomial framework is not 
appropriate for NNCTs based on RL or CSR independence either. 

Note that conditional on Ni ~ Ui, the overall multinomial framework reduces to the row- wise multinomial 
framework. Furthermore, when the parameters are not known but estimated from the marginal sums, all 
frameworks yield tests that are approximately distributed as x\ for large n. 



3.2.1 One-Sided Versions of Pielou's Test of Segregation 



Pielou's test is a general two-sided test, hence it does not indicate the direction of the deviation (e.g., 
positive or negative categorical association) from the null case. To determine the direction, one needs to 
check the NNCT. Since '"^^rL"'' for large n, we can write X^, = Zl where Z„ "^-""^ iV(0, 1), the 
standard normal distribution. By some algebraic manipulations, among other possibilities, for the row-wise 
multinomial framework, Z„ can be written as 



Z,. . ( ^ _ ^ (2) 
n-i n2 J V Ci 62 



See ( Bickel and Doksum (1973)) for the sketch of the derivation. Positive values of Z„ indicate positive 



categorical association, while negative values indicate negative categorical association. When cell counts are 
independent, a reasonable a-level test is rejecting Ho if Z„ > for segregation or if Z„ < for spatial 
association. The a-level x^ test in which we reject for Xp > xj(l — a) is equivalent to the two-sided a-level 
test based on Zji. 

The corresponding test statistic for the overall multinomial framework can be written as 

~Z. = (^n - ^) (3) 
V n J V ni n2 Ci C2 



Once again, we point out that these one-sided tests are not appropriate for testing RL or CSR indepen- 
dence, due to inherent dependence of cell counts in NNCTs based on such patterns. 

Remark 3.1. Appropriate Null Case for Pielou's Tests: In Pielou's test, each of the Poisson and row- 
wise binomial sampling frameworks for cell counts assumes that the trials (i.e., the cross-categorization of 



base-NN pairs) are independent and in the overall multinomial framework, there is mild dependence between 
the cell counts. The independence of rows and individual trials (i.e., cells) would follow if NNCT were based 
on a random sample of (base label, NN label). But unfortunately, this usually is not realistic in practice, 
although it might have theoretical appeal. When we have a random sample of (base label, NN label) pairs 
(which are also called the (base, NN) pairs), the null hypothesis for Pielou's test is equivalent to the case 
that the vector of probabilities for the cell frequencies for each row are identical. Hence if the NNCT is based 
on a random sample of (base, NN) pairs, then any of the sampling frameworks would be appropriate which 
in turn implies the appropriateness of Pearson's test of independence for the NNCT. The null hypothesis in 
each of the sampling frameworks will imply independence between the patterns of the two classes. On the 
other hand the alternatives of positive categorical association will correspond to segregation of the classes, 
while negative categorical association will correspond to spatial association of the classes. □ 



3.3 Dixon's NNCT-Tests 



Dixon proposed a series of tests for s egregat i on ba sed on NNCTs, namely, cell- and class-specific tests, and 
overall test of segregation under RL ( DixonI ( 19941 )). 



3.3.1 Dixon's Cell-Specific Tests 



The level of segregation is estimated by comparing the observed NNCT cell counts to the expected NNCT cell 
counts under RL of fixed point s. Dixon demo nstrates that under RL, one can write down the cell frequencies 
as Moran join-count statistics (iMoranI (Il948l) '). He then derive s the means, variances, and covariances of the 
cell counts (i.e., frequencies) (jDixonlj 19941 ) and Dixon ( 2002a[) ). 



When the null hypothesis is RL, we have 



or equivalently 



n[n — 1) 



ni{ni ~ l)/{n - 
rii rij / {n — 1) 



1) ifi=j, 
if « 7^ j, 



(4) 



n{n — 1) 



The test statistic suggested by Dixon is given by 



where 



Var[iVi., 



(n -I- R)pii + (2n -~2R + Q)piii -f (n^ — 3n - Q + R)piiii - (npu)^ if i = j, 



(5) 



(6) 



^npij + Qpiij + (n^ - 3n - Q + R)piijj — (npij)'^ if i / j, 

with pxx, Pxxx, and Pxxxx are the probabilities that a randomly picked pair, triplet, or quartet of points 
respectively, are the indicated classes and are given by 



Pi'. 



Piijj 



Hj [rii - 1) 

n[n — \) ' 

rij {rii ~ 1) {rii ~ 2) 
n{n-l){n-2) ' 
rii {rii - 1) rij {rij - 1) 

n{n~ l)(n-2) (n - 3) ' 



n(n — \) 
rii {n-i — ^) rij 
" n(n-l)(7i-2)' 
_ rij {rii - 1) {rii - 2) {n^ - 3) 
~ n(n- l)(n-2)(n-3) ' 



(7) 



Furthermore, Q is the number of points with shared NNs, which occurs when two or more points share a 
NN and R is twice the number of reflexive pairs. Then Q = 2 {Q2 -I- 3 Qs -f 6 (^4 + 10 Q5 -I- 15 Qe) where Qk 
is the number of points that serve as a NN to other points k times. 



One-sided and two-sided te sts are possib le for each cell using the asymptotic normal approximation 
of Z[j given in Equation ([5]) (jPixonl (|l994l )). In Dixon's framework, Nij are random quantities; and the 
quantities in the expectations, hypotheses, and variances are conditional on A'^ = n and Ni ~ rii for i G {1, 2}. 
The column sums are irrelevant for Dixon's tests. 



We describe the setting in a broader context. Let Vi be the probability of an arbitrary point being from 
class i. Then under RL, Tr^ = Vi Vj and the expression I(t = j) -I- I(i 7^ j) can be viewed as 

an estimate of tt^ and denoted as iiij. Furthermore, given large N = n, under the null hypothesis of RL the 
expected values given in Equation (|4|) implies 

Ho : TTy = i^, i^j 

and the test statistic {Nij/n — j ) / V Var [Nij ] is approximately equivalent to in Equation ([S]). In 
Dixon's framework, for large n and n^, the row marginals satisfy E[Ni/n\ = Vi and the column marginals 
satisfy E[Cj/n] = kj = X]'i=i '^i ~ ^j- 



3.3.2 Dixon's Overall Test of Segregation 



Dixon's overall test of segregation tests the hypothesis that expected values of the cell counts in the NNCT are 
equal to the ones given in Equation ([4]). In the two-class case, he calculates Zu = [Nu — E[A'^ii])/ ^JWarlNa] 
for bo th i € |1, 2| a nd then combines these test statistics into a statistic that is asymptotically distributed 
as ( DixonI ( 1994 ) V The suggested test statistic is given by 



N22 - E W22 



Var[7Vn] 

Cov[iVil,7V22] 



Cov[7Vil,7V22] 

VarW22] 



Nil - E[iVn] 

N22 - E[7V22] 



(8) 



where E[A'ii] are as in Equation ([4]), Va.r[Nii] are as in Equation ([6]), and 

Cov[iVii, N22] = {n^ -in-Q + R) P1122 - n"^ PiiP22- 

Under Hq : 'E[Nii/n] = vf for i = 1,2, and E[A'|,] = 2 and Var[A'|,] = 4; i.e., the non-centrality parameter 
A = 0. If we parametrize the segregation alternative as : E[A^ii/n] = {fi + ei)^ and E[A^22/'^] = 
{I — ui + £2)^ for some £1, £2 > 0. Then under , the non-centrality parameter satisfies A = A(£i, £2) > 
since A(£i,£2) = Es[Y]'I]g^Es[Y] where 



1 



Es[Y]'-[(i.l+£l),(l-Z.l+£2)] 



and Ss is the (positive definite) variance-covariance matrix of the cell counts under . If the association 
alternative is parametrized as above with £i,£2 < then we obtain the same non-centrality parameter 

A(£i,£2). 



iDixon (2002a) extends his test for multi-class case (i.e., for the case with three or more classes). He 
also partitions the overall test statistic X^^ into class-specific test statistics each of which are dependent but 
approximately follow a distribution. 



3.4 Dixon's Tests under CSR Independence 



The exp ected values o f the NNCT cell counts given in Equation Q are derived under RL or CSR indepen- 
dence bv iDixon (1994). However, the var iances a nd co varia nces of t he cell counts used in Sections 13.3. II and 
13.3.21 are derived under RL pattern only (jDixonl (jl994I ) and iDixonI (j2002al )). 



When the null hypothesis is CSR independence, the expressions for the variances and covariances of the 
NNCT cell counts are as in RL case, except they are conditional on Q and R. The quantities Q and R are 
fixed under RL, but random under CSR independence. Hence under CSR independence Dixon's cell-specific 
test given in Equation ([5]) asymptotically has N{0, 1) distribution and overall test given in Equation ([8|) 



asymptotically has X2^ conditional on Q and R. Under the CSR independence pattern, the unconditional 
variances and covariances (hence the unconditional asymptotic distributions) can be obtained by replacing 
Q and R with their expectations. 

Unfortunately, given the difficulty of calculating the expectations of Q and R under CSR independence, it 
seems reasonable and convenient to use test statistics employing the unconditional variances and covariances 
even when assessing their behavior under the CSR independence pattern. Alternatively, one can estimate 
the values of Q and R empirically, and substitute these estimates in the variance and covariance expressions. 
For example, for homogeneous planar Poisson pattern, we have 'E[Q/n] « .632786 and E[i?/n] « 0.621120 
(estimated empirically by 1000000 Monte Carlo simulations for various values of n = ni + 712 on the unit 
square) . 



To assess the influence of conditioning on the performance of Dixon's tests for the two-class case, we 
consider both the conditional version of these tests, as well as the unconditional version, in which the terms 
Q and R are replaced by 0.63 ri and 0.62 n, respectively. We call the latter type of correction as QR- 
adjustment and the transformed tests as QR-adjusted tests, henceforth. QR-adjusted version of Dixon's 
cell-speciflc test statistic for cell (i,j) is denoted by Z^'"^^ and of the overall test statistic is denoted by 

Remark 3.2. Extension of NNCT- Tests to Mu l ti-Clas s Case: Dixon has extended his tests into multi- 
class situation with three or more classes (|Dixonl (j2002a[) ). For q classes with q > 2, the corresponding 
NNCT is of dimension q x q. It is possible to define q'^ cell-specific tests as in Equation ([S]), and one can 
combine the tests into one overall test similar to the one given in Equation ([S]), which will have Xg(g_i): 
asymptotically. On the other hand, Piclou's test is defined and has only been used for the two-class spatial 
patterns. Its inappropriateness discourages its immediate extension to multi-class patterns. □ 



3.5 Comparison of Pielou's and Dixon's Tests 

Dixon points out two problems with Pielou's test of independence: (i) it fails to identify certain types of 
segregation (e.g., mothe r -daug hter processes) and (ii) the sampling distribution of NNCT cell counts is not 
appropriate (see DixonI ( 19941 )). In a mother-daughter process, mothers are distributed randomly in the 



region of interest, while the daughters are randomly displaced within close vicinity of their mothers. In such 
a process, it is possible to obtain a NNCT in which the cell counts are very similar to the ones expected 
under the Pielou null hypothesis, while in reality the pattern exhibits the segregation of the daughters. For 
more detail on mother-daughter processes and e xamples for which Pielou's tes t giving misleading results, see 
(|Dixonl (|1994| )). Problem (u) was first noted bv iMeagher and Burdickl ([l983) who identify the main source 



of it to be reflexivity of (base, NN) pairs. As an alternative, they suggest using Monte Carlo simulations for 
Pielou's test. Dixon shows that Pielou's test is not ap propriate for c ompletely mapped data, but suggests 
that it might be appropriate for sparsely sampled data (Dixon ( 1994 )). 



In Pielou's test, each of the sampling frameworks requires that the cell counts arc independent. However, 
when a trial is label categorization of a (base, NN) pair, the assumption of independence between trials is 
violated due to reflexivity and shared NN structure. Thus Pielou's test measures deviations not only from 
the null pattern of RL or CSR independence but also from the independence of trials. This also suggests 
that Pielou's test would be liberal in rejecting the null hypothesis. The reflexivity and shared NN structure 
are not merely fini t e sam ple patterns, as they follow a certain non-degenerate distribution even when n ^ 00 
(IClark and EvansI ^95^). 



By construction, Pielou's test is used to test independence of the class labels of the (base, NN) pairs, 
but ignores the spatial information (hence ignores the spatial dependence, e.g., reflexivity of NNs). On the 
other hand, Dixon's tests are used for the null hypotheses of RL or CSR independence and uses more of 
the spatial information. For Dixon's tests, the underlying sampling framework for cell counts is different 
from Poisson, row-wise binomial, or overall multinomial sampling models of the contingency tables. In his 
framework, the probability of class j point serving as a NN of a class i point depends only on the class 
sizes (i.e., row sums), but not the total number of times class j serves as a NN (i.e., column sums). On 
the other hand, Piclou's test depends on both row and column sums. In fact, Pielou starts her arguments 



with NN probabilities depending on class sizes (row sums) in (jPieloul (jl961l ). pp 257-258). Then she leaves 
this traek of deve l opmen t because of dependence due to shared NN structure (i.e., the distribution of Qk 
( Clark and Evand (Il955l )). For testing RL or CSR independence, Dixon's framework is more appropriate as 
a sampling distribution for NNCT cell counts, as it accounts for the inherent spatial dependence between 
observations. 



4 Consistency of the NNCT-Tests 

The null hypotheses are different for Pielou's and Dixon's framework of testing spat ial pat t erns, and so 



are the alternative hypotheses. Hence the acceptance regions are different (see, e.g., iDixonI (jl994[ )). and 
no test is uniformly superior to the other, since both Ajj \ Ap and Ap\ Ao are non-empty, where Ao is 
the acceptance region for Dixon's test and Ap is the acceptance region for Pielou's test. That is, there are 
situations in which Pielou's test yields a significant result, while Dixon's test finds no significant segregation, 
and vice versa. For example, a pattern resulting from a mother/daugter process can fall in Ap \ Ad (see 
Section l3.5p . On the other hand, a process in which row and column sums in a NNCT are close but the 
cell counts are different than expected under Pielou null hypothesis, and similar to the ones expected under 
Dixon null hypothesis might yield a pattern that falls in \ Ap. Therefore the comparison of the tests 
(even for large samples) is inappropriate. But any reasonable test should have more power as the sample size 
increases. So, we prove that the tests under consideration are consistent, although they have appropriate 
size under different null hypotheses. The proofs of lemmas and theorems in this section are all deferred to 
the Appendix. 

In the following theorems we use the consistency of tests based on statistics that have A^(0, 1) or xt 
distributions asymptotically. First, we prove the consistency of Pielou's test of segregation and the one-sided 
versions. Let z{a) be the 100(1 — a)*'' percentile for the standard normal distribution and xti'^) be the 
100(1 — a)*'' percentile for distribution with ly df. 

Theorem 4.1. (I) Suppose the NNCT is constructed based on a random sample of (base, NN) pairs. The 
test for segregation : tth > 7r22 (spatial association : tth < n22) which rejects Ho : ttii = 7r22 for 
Zn > z{l — a) (for Zn < z{a)) with Zn given in Equation ^ has size a and is consistent. Likewise, the 
test against segregation : 'Ei[Nii/n] > vi ki which rejects Ho : E[Afii/n] = ki for Zn > z{l — a) with 
Zn given in Equation ^ has size a and is consistent. 

(H) Under RL or CSR independence, the size of the above one-sided tests are larger than a (i.e., the tests 
are liberal in rejecting Ho) but consistent (in the sense that the power goes to 1 as marginal sums tend to oo 
under the alternatives). 

Theorem 4.2. (L) Suppose the NNCT is based on a random sample of (base, NN) pairs. The test for 

Ha ■■ TTii 7^ TT21 which rcjccts Ho : tth = tt2i for > X?(l - a) with = X^Li I^j=i '^^''e^!^]'^^ 
consistent. 

(H) Under RL or CSR independence, the level of the test using Xp is larger than ex. (i.e., it is liberal in 
rejecting these null patterns) but is consistent in the sense of part (LL) of Theorem \4-.l\ 

Next, we prove the consistency of Dixon's cell-specific and overall tests of segregation. 
Theorem 4.3. Under RL, Dixon's cell-specific test for cell {i,j) in a NNCT denoted by Zjj ; i.e., the test 
rejecting Hq '■ Tr.y = "^["l"^) I(^ = j) + „"„-!) ^(^ j) C*-e., RL ) against the two-sided (and one-sided 
alternatives) for \Z^\ > z(l — a/2) (and Zj^ > z(l — a) or Zj^ < z(a)) with Z^ = ^[-'''^'jI gj^e a 

and is consistent. Under CSR independence, Zjj is consistent conditional on Q and R. 

Theorem 4.4. Under RL, Dixon's overall test of segregation; i.e., the test rejecting Ho '■ TTy — "nin'-^i)"^ -^(^ ~ 
■?') + „"«" 1) ^ -f^^ ^ r^-e-, RL) against the alternative Ha : tt^- ^ "n\'n-i) = )) + 

^ ]) for some i,j G {1, 2} for X^ > xl{l - «) with X^ = {N - E[N])'S]-(N - E[N]) is of size a 
and is consistent. Under CSR independence, X^ is consistent conditional on Q and R. 



5 Monte Carlo Simulation Analysis 



Piclou's test of independence and Dixon's overall test of segregation are not testing the same null pattern, 
so we can not compare the power of the tests under either segregation or association alternatives and we 
only implement Monte Carlo simulations to evaluate the finite sample performance of the tests in terms of 
empirical size. For the null case, we simulate the RL, CSR independence patterns, and independence of the 
rows in the NNCTs, with two classes labelled as X and Y with sizes ni and 712, respectively. 



5.1 Empirical Significance Levels of the NNCT-Tests under RL 

Under RL, we co nsider four ca ses. In RL Case (1) we use the locations of the trees in the swamp tree data 
(see Figure[2]and Dixon ( 1994 )) as the fixed points, and randomly assign ni = 182 points as X and 77.2 — 91 



points as Y points. In each of the other RL cases, we first determine the fixed locations of points for which 
class labels arc to be assigned randomly. Then we apply the RL procedure to these points for respective 
sample size combinations as follows. 

RL Case (2): First, we generate n = ni + n2 points iid U{{Q, 1) x (0, 1)), the uniform distribution on the 
unit square, for some combinations of ni,n2 £ {10,30,50,100}. In each (711,77.2) combination, the locations 
of these points are taken to be the fixed locations for which we assign the class labels randomly. For each 
sample size combination (77,1,772), we randomly choose Tii points (without replacement) and label them as 
X and the remaining 712 points as Y points, and repeat the RL procedure N^^c = 10000 times. At each 
Monte Carlo replication, we compute the NNCT-tcsts. Out of these 10000 samples the number of significant 
outcomes by each test is recorded. The nominal significance level used in all these tests is a = .05. The 
empirical sizes are calculated as the ratio of number of significant results to the number of Monte Carlo 
replications, Nmc- That is, for example empirical size for Dixon's overall test for (10, 10), denoted by ao, 
is calculated as an ■= J^i^" ^{-^d i — xK-O^)) where ^ is the value of Dixon's overall test statistic for 
iteration i, xK-O^) is the 95*'* percentile of X2 distribution, and I(-) is the indicator function. 

RL Case (3): We generate ni points iid U{{0, 2/3) x (0, 2/3)) and 772 points iid U{{l/3, 1) x (1/3, 1)) for 
some combinations of 771,772 G {10,30,50,100}. The locations of these points are taken to be the fixed 
locations for which we assign the class labels randomly. The RL procedure is applied to these fixed points 
Nmc = 10000 times for each sample size combination and the empirical sizes for the tests are calculated 
similarly as in RL Case (2). 

RL Case (4): We generate 77i points iid U{{Q,1) x (0,1)) and 772 points iid U{{2,3) x (0,1)) for some 
combinations of 711,712 G {10, 30, 50, 100}. The RL procedure is applied and the empirical sizes for the tests 
are calculated as in the previous RL Cases. 

The locations for which the RL procedure is applied in RL Cases (2)- (4) are plotted in Figure [T] for 
77i = 772 = 100. Although there are many possibilities for the allocation of points to which RL procedure can 
be applied, we only chose the locations of trees in a real life data set and three generic cases. Observe that 
in RL Case (2), the allocation of the points are a realization of a homogeneous Poisson process in the unit 
square; in RL Case (3) the points are a realization of two overlapping clusters; in RL Case (4) the points are 
a realization of two disjoint clusters. 

The empirical significance levels are presented in Table [U where Sf, is the empirical significance level 
for cell (1,7) with i G {1,2}, cxr and Sl are the estimated empirical significance levels for the right- and 
left-sided versions of Pielou's test, respectively (see Equation dtp and Spy are for Pielou's overall test 
of segregation without and with Yates' correction, respectively, Sd is for Dixon's overall test of segregation 
with 77i, 772 € {10, 30, 50, 100} and Nmc = 10000. Notice that among the cell-specific tests only a^^ and S|^2 
are presented in Table[31 since iVi2 = 77i — iVn and N21 = 772 — N22 which implies Sfi = af 2 ^21 ~ ^22 
in the two-class case. The empirical sizes significantly smaller (larger) than .05 are marked with (^), which 
indicate that the corresponding test is conservative (liberal). The asymptotic normal approximation to 
proportions is used in determining the significance of the deviations of the empirical size estimates from the 
nominal level of .05. For these proportion tests, we also use a ~ .05 to test against empirical size being equal 



to .05. With Nmc = 10000, empirical sizes less (greater) than .0464 (.0536) are deemed conservative (liberal) 
at a = .05 level. Observe that Dixon's cell-specific tests are slightly liberal or conservative or about the 
desired significance levels in rejecting Ho : RL when ni,n2 > 30. When rij < 10 for i = 1 or 2, then Dixon's 
cell-specific tests tend to be conservative if ni ^ n2 and liberal otherwise. Notice also that when rii < 10 for 
z = 1 or 2 and ni ^ n2 Dixon's cell-specific test is more conservative for cell (1, 1) which corresponds to the 
class with smaller size (i.e., class X) compared to class Y. On the other hand, Pielou's overall test and the 
right sided version of Pielou's test are extremely liberal for all sample size combinations; left sided version of 
Pielou's test is extremely liberal for all sample size combinations except for (10,50). Furthermore, Pielou's 
test with Yates' correction is liberal when min(ni,n2) > 30, conservative for (10,50) and liberal or about 
the nominal level for other sample size combinations. Notice also that. Spy values are significantly smaller 
(based on the tests of equality of the proportions for two populations) compared to Sp values. Dixon's 
overall test of segregation tends to be conservative for (10, 10) and (10, 30) and is about the desired nominal 
level for most of the other sample size combinations. These results suggest that under RL, Dixon's tests 
(especially the overall test) are appropriate, but Pielou's tests are not. 



5.2 Empirical Significance Levels of the NNCT-Tests under CSR Independence 

Under CSR independence, at each of Nmc = 10000 replicates, we generate points iid from U{{0, 1) x (0, 1)), 
for some combinations of ni, 712 G {10, 30, 50, 100}. Let X = {Xi, . . . , X^} be the set of class 1 points and 
Y = {Yi, . . . , Yn^} be the set of class 2 points. 

We present the empirical significance levels for Pielou's tests and Dixon's tests in Table [31 where Sfj''^'^ 
is Dixon's cell-specific test for cell and aD,qr is Dixon's overall test with Q and R are replaced with 
their (empirical) expected values and notations for the other tests are as in Section 15.11 The empirical 
sizes are calculated for some combinations of rti,7i2 G {10,30,50,100} and Nmc = 10000. Observe that 
Dixon's cell-specific tests tend to be slightly liberal or conservative or about the desired significance level 
for most sample size combinations. In particular, they have about the nominal level for ni = 712 > 30, 
and are extremely conservative for the smaller class when Ui = 10 for one of i = 1,2 (see, e.g, a^^ for 
(711,712) = (10,50) and af'2 for (711,712) — (50, 10)). The QR-adjusted versions of cell-specific tests tend to 
have different sizes than the uncorrected versions when rii < 30 for i = 1 or 2. For larger samples, QR- 
adjustment does not improve the sizes compared to the uncorrected ones. Pielou's overall test and one-sided 
versions arc extremely liberal for all sample sizes (however, notice that Pielou's overall and left-sided tests 
are least liberal for (721,712) = (10,50) and (711,712) = (50,10)). Pielou's test with Yates' correction is at 
the nominal level for (711,712) = (30,10), conservative for (711,712) = (10,50) and (711,712) = (50,10), and 
liberal for other sample size combinations. Notice also that apy values are significantly smaller than ap 
values (i.e., Yates' correction significantly reduces the empirical size for Pielou's overall test). As for Dixon's 
overall test, it tends to be conservative for small sample size combinations, and is about the desired level 
for most large sample size combinations. As in the cell-specific tests, QR-adjustm ent does not im prove on 
the uncorrected versions. For more detail on QR-adjustment for NNCT-tests, see (jCevhan jiooi)). Hence 
in the following sections, we only provide the uncorrected versions of Dixon's tests. 

Remark 5.1. Proportion of Agreement between Pielou's and Dixon's Overall Tests: At each 
sample size combination under the RL Cases (2)-(5) and CSR independence, we also record the number of 
times both Pielou's and Dixon's overall tests simultaneously yield significant results at a = .05. The ratio of 
number of significant results by both tests to the number of Monte Carlo replications, Nmc, is the proportion 
of agreement between the tests in rejecting the particular null pattern. That is, for example the proportion of 
agreement between Pielou's and Dixon's overall tests denoted by ap^o for (tii, 712) = (10, 10) under RL Case 
(2) is calculated as Sp£, := J^f^i ^{^h - X?(0-95))I(A'|, ,^ > xi(0.95)) where X^^^ is the value of Pielou's 
overall test statistic for iteration i. The estimates of the proportion of agreement values are presented in 
Table m Observe that ap^jj values are significantly smaller than min(ap,a£)) = ajj at each sample size 
combination under each null case. This supports the discussion in the first paragraph of Section [4l that is, 
the rejection regions (hence acceptance regions) for both tests are significantly different and neither one is 
uniformly superior to the other. □ 



1 n 



5.3 Empirical Significance Levels of the Tests under Independence of Rows and 
Cell Counts in NNCTs 



For the independence of rows and cell counts in the NNCTs, we consider two cases: overall multinomial 
and row-wise binomial frameworks. In the overall multinomial case, we generate all four cell counts using 
multinomial distribution, ^(n,7rii,7ri2,7r2i,7r22), with tth = 7r2i = 2(ni"+n2) ^^'^ ^ ^^22 = 2{ni+n2) ■ 
The NNCT constructed in such a way is (approximately) equivalent to one based on a random sample of 
(base, NN) pairs. In the row-wise binomial case, we generate the two cell counts in each row using binomial 
distribution, Nu ^ BIN(ni, ni/(ni -1-712)), and N21 BIN(n2, ni/(ni +n2)). The NNCT constructed in 
this way is also equivalent to one based on a random sample of (base, NN) pairs. 

For such NNCTs, we can only compute Pielou's test and the one-sided versions, but not Dixon's tests, 
since Dixon's tests require more information on the NN structure in the spatial distribution of the points, 
e.g., the quantities such as Q and R, which are not available in this case. That is, in any spatial allocation 
of points, the NN relations of each point is dependent on the the relations of neighboring points, which in 
turn implies that it is not realistic to have a random sample of (base, NN) pairs in practice. 

In Table [51 we present the empirical significance levels for all tests except Dixon's tests under the inde- 
pendence of cells and rows case. Observe that Pielou's test with Yates' correction is extremely conservative 
under both of the frameworks. On the other hand Pielou's one-sided tests and Pielou's test without Yates' 
correction have about the desired nominal level. 

Remark 5.2. Main Result of Monte Carlo Simulations for Empirical Sizes: Based on the simulation 
results under the CSR of the points, we recommend the disuse of Pielou's test in practice, as it is extremely 
liberal, hence it might give false alarms when the pattern is actually not significantly different from RL or 
CSR independence. Moreover Yates' correction docs not seem to fix the problems with Pielou's test, since 
the problems are not caused by the discrete nature of the cell counts. However, Yates' correction seems 
to improve the performance of Pielou's test, in the sense that empirical size of Pielou's test with Yates' 
correction gets closer to the nominal level compared to the uncorrected one. Even Dixon's tests fail to have 
the desired level when at least one sample size is small so that the cell count (s) in the corresponding NNCT 
have a high probability of being < 5. This usually corresponds to the case that at least one sample size is 
< 10 or the sample sizes (i.e., relative abundances) are very different in our simulation study. When sample 
sizes are small (hence the cor respon d ing N NCT cell counts are < 5), the asymptotic approximation of Dixon's 
tests is not appropriate. So iDixonI (|l994h recommends Monte Carlo randomization for his test when some 



NNCT cell count(s) are < 5 under RL. We concur with the same recommendation for the RL pattern and 
extend this recommendation for CSR independence. In fact, this recommendation is also partly consistent 
with the inapplicability of asymptotic results for contingency tables in general (not just for NNCTs) when 
cell counts arc too small. In general contingency tables, the chi-squared approximation seems to be valid 



in mo st cases if all expected cell counts are larger than 0.5 and at least half arc greater than 1.0 (jConover 



(199^). On the other hand, Cochran (1952) states that the approximation may be poor if any expected cell 



count is less than 1 or if more than 20 % of the expected cell counts are less than 5. □ 



6 Edge Correction for the CSR Independence Pattern 



In this section, we investigate the edge or boundary effects on the NNCT-tests used under CSR independence. 
Edge effects arise because CSR independence assumes an unbounded region, which is not the case in practical 
situations. Edge effects on s patial pattern ana l ysis a nd v ariou s correction methods are discussed extensively 
in spatial pattern literature (jClark and Evand (|l954 and ICressic (1991'). However, the effec tiveness of edge 
correction depends on the type of the statistic used (see, e.g., Yamada and Rogersenl ( 20031 )). For example, 
when the study region is rectangular, the edge effects can be minimized by including a buffer zone or area 



around the rectangle, or alternatively, the rectangular re gion is transformed into a tor us ( Dixon ( 2002bf )). 
In literature, buffer area is also referred to as guard area ([Yamada and Rogersen (|2003M The general idea 
is the same for buffer zone and toroidal edge corrections, but they are implemented in different ways. In 
buffer zone correction we assume the properties of the process are the same, even if we continue into the 
buffer area. In toroidal correction, the process is exactly the same outside of the study area. 



Without any edge correction the cell counts in a NNCT can be written as 

n n n 
k=l 1=1 k,l = l 

where Ski is 1 if point I is of class j and point k is of class i, and otherwise; Wki is 1 if point I is the NN of 
point k, and otherwise. The quantities Q and R can be written as 

n n 

^ WkiWmil{m ^ k ^ I) and R ^ ^ Wkiwikl{l k), 

m,k,l=l k,l = l 

with the understanding that l{in ^ k ^ I) = l{m ^ k)\{ni ^ l)l{k ^ I). 

6.1 Buffer Zone Correction for the CSR Independence Pattern 

In the buffer zone correction method, a guard area is selected inside or outside the study region and the 
points in the guard area are used only as destinations (not the base points) in NN relations. In the NN pair 
([/, y), point U is the base point, and point V is the destination point. When the buffer area is sufficiently 
large, the edge effects can be completely eliminated, but this is a wasteful procedure, because the large buffer 
area may contain many observations. 

Let Ro be the original study area, i?^ be the outer buffer area, and i?b be the inner buffer area and let n be 
the number of points that fall in Rq- In the outer buffer zone correction, let ub be the number of points that 
fall in Rq U Rb, and points with indices 1, 2, . . . , n lie in Rq, and points with indices (71 + 1), (71 + 2), . . . , 71^ 
lie in Rb- With the outer buffer zone correction, the NNCT cell counts are 

N,j=Y,J2'^kiSkiI{k^l). 

k=l 1=1 

Furthermore, the quantities Q and R are also modified as follows: 

riB n Ub tib 

Q = ^ ^ WkiWr,iil{m ^ki^l) and = ^ WkiWikI{k i^l)- ^ WkiWikI{k 7^ 0- 

m.,k=l 1=1 k,l=l k,l=(n+l) 

In the inner buffer zone correction, let ni, be the number of points in the inner buffer area Rh, and points 
with indices 1, 2, . . . , (n — 77,;,) lie in Rq \ Rb, and points with indices {n — + 1), {n — rif, + 2), . . . ,n lie in 
Rty. Then, with the inner buffer zone correction, the NNCT cell counts are 

n — rif, n 

N,, = Y,WkiSkil{k^l). 

k=l 1=1 

Furthermore, the quantities Q and R are 

n n — Tib n ub 

Q = Y Y WkiWmilim ^k^l) and R = ^ WkiWikI{k ^ I) - ^ WkiWikl{k ^ I). 

m,fc— 1 1=1 k,l=l k,l={n—nij + l) 

In our simulation study, we consider the edge correction by outer buffer zone correction only. For CSR 
independence, we generate points iid from U{{~.5, 1.5) x (—.5, 1.5)) until there are rii class X points and 
7^2 class Y points in the unit square for some combinations of 77.1,77.2 G {10,30,50,100}. We repeat this 
procedure Nmc = 10000 times for each 77.1,77.2 combination. The corresponding empirical significance levels 
are provided in Table [6] Observe that compared to the uncorrected sizes, with the (outer) buffer zone edge 
correction, the empirical sizes of the right-sided version of Pielou's test do not significantly change; empirical 
sizes of the left-sided version of Pielou's test significantly decrease for smaller samples, do not significantly 



change for other sample size combinations; empirical sizes of Pielou's test with Yates' correction do not 
significantly change (except for (10, 10)); empirical sizes of Dixon's cell-specific tests do not change for most 
sample size combinations. On the other hand, for Dixon's test, the empirical sizes significantly increase to 
become liberal for ni+n2 < 80. Furthermore Pielou's overall and one-sided tests still tend to be liberal with 
the (outer) buffer zone correction; Pielou's test with Yates' correction is liberal when ni +n2 > 40. Dixon's 
cell-specific tests become about the desired level when rii > 50 for both i = 1,2, the sizes do not change or 
improve in one direction for smaller samples. 

Remark 6.1. Inner versus Outer BufTer Zone Correction: The main difference between inner and outer 
buffer zone correction is the time of the selection of the buffer zone. In the outer buffer zone correction, a 
larger region than the intended region of interest is selected prior to recording the observations; while in the 
inner buffer zone correction some part of the original study region is designated as the buffer zone after the 
data is collected. Hence, theoretically the inner and outer buffer zones behave similarly. Indeed, Ro\Rb in the 
inner buffer zone correction acts like the original region Rq of the outer buffer zone correction, and likewise 
Rb in inner buffer zone correction acts like i?^ of the outer buffer zone correction. Thus, we only simulate 
the outer buffer zone correction, as it can also be equivalently viewed as the inner buffer zone correction. 
That is, the square (—.5, 1.5) x (—.5, 1.5) can be viewed as Rq and [(—.5, 1.5) x (—.5, 1.5)] \ [(0, 1) x (0, 1)] 
can be viewed as Rt- □ 



6.2 Toroidal Edge Correction for the CSR Independence Pattern 

In the toroidal edge correction, the original area is surrounded by eight copies of the original study area and the 
points in these additional copies are used only as destination points. For the toroidal edge correction, clusters 
around the boundaries might cause bias. Moreover, while toroidal correct ion applies only to rectangular study 
regions, buffer zone correction applies to any type of study region (see, ( Yamada and Rogersen ( 2003f )). 



Let Ro be the original study area, Rt be the eight copies appended to Rq so as to obtain NN structure for 
the points in Rq as if Rq is part of a torus. For the toroidal correction, let nx be the number of points that 
fall in Ro U Rt, and points with indices 1, 2, . . . , n lie in Rq, and points with indices (n + 1), (n -I- 2), ... , ht 
lie in the toroidal area Rt- With the toroidal correction, the NNCT cell counts are 

n riT 

Nu = J2J2'^kiSkiI{k^l). 

k=l 1=1 

Furthermore, the quantities Q and R are also modified as follows: 

Q = ^ ^^WkiWmilim k ^ I), and R = Wkiwiklik ^ I) - ^ Wkiwiki{k ^ I). 

m,k=l 1=1 k,l=l k,l=(n+l) 



For toroidal correction, under Ho : CSR independence, we generate ni X-points and n2 y-points iid from 
ZY((0, 1) X (0, 1)) for some combinations of ni, n2 G {10, 30, 50, 100}. We repeat this procedure Nmc — 10000 
times for each ni,n2 combination. The corresponding empirical significance levels are presented in Table [71 
Observe that toroidal edge correction does not significantly affect the empirical sizes of the NNCT-tests. 

Remark 6.2. Main Result of Edge Correction Analysis: The Monte Carlo analysis in Section[6]suggests 
that the empirical sizes of the NNCT-tcsts are not affected by the toroidal edge correction because in our 
Monte Carlo simulations, we have generated the CSR independence pattern on the unit square. Any clusters 
in a realization of CSR are due to chance and are equally likely to occur anywhere, so the clusters are more 
likely to occur away from the boundary of the region. However, the (outer) buffer zone edge correction 
method seems to have stronger influence on the tests compared to toroidal correction. In particular, the 
empirical sizes of the Dixon's test tend to significantly increase with buffer zone correction. But for all other 
tests, buffer zone correction does not change the sizes significantly for most sample size combinations. 



This is in agreement with the findings of Barot et al. ( 19991 ) which says NN methods only require a small 



buffer area around the study region. A large buffer area does not help too much since one only needs to 
be able to see far enough away from an event to find its NN. Once the buffer area extends past the likely 



NN distances (i.e., about the average NN distances), it is not adding much helpful information for NNCTs. 
Furthermore, since buffer (inner or outer) zone correction methods arc wasteful, and strongly depend on the 
size of the zone, we do not recommend their use for NNCT-tests. On the other hand, one can use toroidal 
edge correction, but the gain might not be worth the effort. □ 



7 Examples 



We illustrate the tests on two example data sets: Dixon's swamp tree data ( Dixon ( 2002bl )) and an artificial 



data set. Wc present the corresponding NNCTs, test statistics, and edge correction results for both examples. 
Since an outer buffer zone is not provided in these examples, inner buffer zone correction is the only type of 
buffer zone correction we can apply. Moreover, since the regions are rectangular, we can also apply toroidal 
edge correction in these examples. 



7.1 Swamp Tree Data 



DixonI (|2002bl ) illustrates NN-methods on tree species in a 50 m by 200 m rectangular plot of hardwood 
swamp in South Carolina, USA. The plot contains trees from 13 different tree species, of which we only 
consider the live trees from two species, namely, black gums and bald cypresses. If spatial interaction among 
the less frequent species wer e impor t ant, a more detailed 12 x 12 NNCT-analysis should be performed. For 
more detail on the data, see (Dixon (l2002bl) '). The locations of these trees in the study region are plotted in 
Figure [H 



DixonI l|2002al ) applies his methodology for this data set assuming the null pattern is the RL of tree species 



to the given locations. But it is more reasonable to assume that the locations of the tree species a priori 
result from different processes. Hence the more appropriate null hypothesis would be the CSR independence 
pattern, which implies that NNCT-test results are conditional ones. The question of interest is whether 
the two tree species are segregated, associated, or do not deviate from the CSR independence pattern. The 
corresponding NNCT and the percentages are provided in Table El The percentages for the cells are based 
on the sample size of each species. That is, for example 82 % of black gums have NNs from black gums, 
and remaining NNs of black gums are from bald cypresses. The row and column percentages are marginal 
percentages with respect to the total sample size. The percentage values are also suggestive of segregation, 
especially for black gum trees. 

For the raw data (i.e., data not corrected for edge effects), we find Q = 178 and R = 156. The test 
statistics are provided in Tabled where stands for Dixon's overall segregation test, Xp and Xpy for 
Pielou's test without and with Yates' correction, respectively, Z„ for the directional Z-test. The p-values are 
for the general alternative of deviation from CSR independence for X^,, Xp, and Xpy', and for Z„, the first 
p- value in the parenthesis is for the association alternative, while the second is for the segregation alternative. 
Observe that all two-sided tests are significant, implying significant deviation from CSR independence. The 
directional (one-sided) tests indicate that black gum trees and bald cypresses are significantly segregated. 

Table |9] also contains the p-values when the edge correction methods are applied. The toroidal cor- 
rection does slightly change the test statistics, but not the conclusions. For the inner buffer zone cor- 
rection, we first estimate the density of the combined species, namely A = (rii + n2)/Ar where Ar is 
the area of the study region. Let W be the distance from a randomly chosen event to its NN, then 

under CSR independence, Ei[W] = 1^ (2\^ and Var[VK] = (4 — 7r)/(4 7rA), where A is the intensity 

of the point process (|Dixonl ll2002d) ). So, we move the boundaries inside the rectangular study area by 



E,[W] + k y^Ya.r[W] = 1^(2 + fc y (4 — 7r)y^ ^4 tt A^ where k determines how much of the study region 

is regarded as the buffer zone. We implement the inner buffer zone correction with fc = 0, 1,2,3, but only 
present the results for fc = 0, 1, since each case yields similar test statistics and the same conclusions. 

Based on the NNCT-tests, we conclude that tree species exhibit significant deviation from the CSR 



t r' 



independence pattern. Considering Figure [H and the corresponding NNCT in Table [51 this deviation is 
toward the segregation of the tree species. However, the results of NNCT-tests pertain to small scale 
interaction, i.e., at about the average NN distances. To understand (possible) causes of the segregation and 
the type and level of interaction between the tree species at different scales (i.e., distances between the trees) 
we also provide Ripley's univariate and bivariate L-functions in Figure [31 where the spatial interaction is 
analyzed for distances up to 50 meters. 

In Figure[3l wc present the plots of Ripley's univariate L-function Lii{t) — t for both species, and bivariate 
L-fimction Li2{t) for the pair of tree species. Due to the symmetry of Lij{t), we omit L2i{t). We also present 
the upper and lower 95 % confidence bounds for each Lait) — t and Li2{t) — t under CSR independence. 
Observe that black gums exhibit significant aggregation for distances t > 1 m (the Lii(i) — t curve is above 
the upper confidence bound); bald cypresses exhibit no deviation from CSR around t « 5 m, then they 
exhibit significant spatial aggregation for t up to 30 m, then for larger distances {t > 45 m) they exhibit 
spatial regularity. Observe also that black gums and bald cypresses are significantly segregated for distances 
up to t w 42 m {Li2{t) — t is below the lower confidence bound), for larger distances their interaction does not 
deviate significantly from CSR independence. The segregation of the species might be due to different levels 
and types of aggregation of the species in the study region. Note also that average NN distance for swamp 
tree data is 3.08 ± 1.70 (mean ± standard deviation) and results of bivariate i-function and NNCT-analysis 
agree for distances around i = 3 m. 



7.2 Artificial Data Set 



In the swamp tree example, although the expected NNCT cell counts (not presented) are different for Pielou's 
and Dixon's tests and p- values for Dixon's tests are larger than others, we have the same conclusion: there is 
strong evidence for segregation of tree species. Below, wc present an artificial example, a random sample of 
size 100 (with 50 X-points and 50 y-points uniformly generated on the unit square). The question of interest 
is the spatial interaction between X and Y classes. We plot the locations of the points in Figure [4] and the 
corresponding NNCT together with percentages are provided in Tabic \8\ Observe that the percentages are 
suggestive of mild segregation, with equal degree for both classes. 

The data is generated to resemble the CSR independence pattern, so we assume the null pattern is CSR 
independence, which implies that our inference will be a conditional one. Observe that in Table [HI Pielou's 
tests are significant while Dixon's test are not, which might be interpreted as evidence of deviation from CSR 
independence. The graph in Figurc[4]is not suggestive of any such deviation from CSR independence, and the 
dependence between NNCT cell counts might be confounding the conclusions based on Pielou's tests. With 
toroidal correction, all p- values increase, but only for Pielou's test with Yates' correction becomes insignificant 
after the correction. With buffer zone correction with k = 0, all p-values get to be insignificant, except for 
the one-sided test for segregation. Furthermore, with fc = 1, the changes in p- values are more dramatic. We 
also have similar changes with fc = 2, 3 (not presented). Thus, inner buffer zone edge correction, might make 
a big difference if the pattern is a close call between CSR independence and segregation/association. That is, 
if a segregation test has a p- value about .05, when a subregion is reserved as the inner buffer zone, either we 
might have a pattern different from CSR independence in the area for the base points (i.e., Rq \ Rb) or after 
the loss of data in the buffer zone the power of the tests might decrease. We also point out that, inner and 
outer buffer zone corr ection methods are not r ecom mended in literature for spatial pattern analysis with, e.g., 
Ripley's A'-function (jYamada and Rogerse 3 (|2003h V and we concur with this suggestion for NNCT-tests. 



Since Pielou's and Dixon's tests give different results in terms of significance, we also provide Ripley's 
univariate and bivariate i-functions in Figure [51 where the spatial interaction is analyzed for distances up 
to t = 0.25. Observe that X points exhibit significant regularity for distances .18 < t < .24 and no deviation 
from CSR for other distance values. Y points do not deviate significantly from CSR for all distances 
considered, although they are close to being regular. Observe also that X and Y points are significantly 
associated for distances around t ~ 0.06 and t « 0.10, for all other distances their interaction does not 
deviate significantly from CSR independence. Hence we conclude that the significant segregation implied 
by Pielou's test seems to be a false alarm, since in fact, the spatial interaction between the points is not 
significantly different from CSR independence. Note also that average NN distance for the artificial data is 



-1 r-r 



.052 ± .03 and results of the bivariatc L-function and NNCT-analysis agree around t = .05. 



8 Discussion and Conclusions 



In this article we discuss segregation tests based on nearest neighbor contingency tables (NNCTs). These 
NNCT-tests include Pielou's test (with or without Yates' correction), Dixon's cell-specific and overall tests, 
and the newly introduced one-sided versions of Pielou's tests. A summary of the test statistics together with 
the underlying assumptions, the appropriate null hypotheses, and the quantities the statistics are conditional 
on are presented in Table fTOl 

Pielou's a nd Dixon's t ests were both d efine d under the nu ll hypothesis of random labeling (RL) of a fixed 
set of points (jPieloul (|l961lDixonl (|l994[) andlDijconI (|2002a[) ). It has been shown that Pielou's test is not 



appropriate for testing RL, but Dixon's tests are. The main problem with Pielou's NNCT-tests (including 
the one-sided versions) discussed in this article is the dependence between trials (i.e., categorization of (base, 
NN) pairs) and between the NNCT cell counts. 

In this article, we extend the use of the NNCT-tests for the CSR of points from two classes (i.e., CSR 
independence). We demonstrate that Pielou's tests are liberal for rejecting RL or CSR independence, while 
Dixon's tests are appropriate for large samples. For smaller samples (i.e., when some cell count in the NNCT 
is < 5) we recommend the Monte Carlo randomization version for NNCT-tests. We also show that Pielou's 
tests are only appropriate for a NNCT based on a random sample of (base, nearest neighbor (NN)) pairs 
(which is not a realistic situation in practice). We prove the consistency of the tests under appropriate 
null hypotheses; evaluate the empirical size performance of these tests based on an extensive Monte Carlo 
simulation study under RL, CSR independence, and independence of cell counts and rows in Section [5l 
Based on the Monte Carlo analysis, for moderate to large sample sizes Dixon's tests are recommended. 

Under CSR independence, edge or boundary effects might be a potential concern for the NNCT-tests. 
Based on Monte Carlo analysis for edge correction methods, with buffer zone correction, we find that the 
uncorrected and corrected results are not significantly different. In particular, buffer zone correction methods 
(with larger distances than the average NN distances) are not recommended, as they are wasteful procedures 
and do not serve the purpose of correcting for the edge effects. The outer buffer zone correction with large 
buffer areas is redundant hence not worth the effort, while inner buffer zone correction with large buffer areas 
is wasteful and might cause bias and loss of power in the analysis. With toroidal correction only significant 
change occurs for Dixon's overall change, but corrected versions are usually liberal. Hence edge effects are 
not significantly confounding the results of the NNCT-tests (under CSR independence). 

In this article, NNCTs are based on NN relations using the usual Euclidean distance. But, this method 
can be extended to the case that NN relation is based on dissimilarity measures between observations in 
finite or infinite dimensional space. It is even possible to have situations that arc completely non-spatial 
and yet one can conduct NNCT analysis based on dissimilarity measures. In this general context the NN 
of object X refers to the object with the minimum dissimilarity to x. The extension of RL pattern is 
straightforward, but extra care should be taken for such an extension of CSR independence. For example, in 
either case, the term Q (in Dixon's tests) which is the number of points with shared NNs needs to be revised 
as Q := 2 X]fe=i (2)^'=- conjecture that these tests when applied to other fields for high dimensional 
data and NN relations based on dissimilarity measures, can be useful. 



Appendix 

Proof of Theorem 14.11 



(I) Suppose under Ho, NNCT is based on a random sample of (base, NN) pairs. For the two-class case, we parametrize 
the segregation alternative as Ha ■ vfii = ^21 + e for e £ (0, 1 — 7721). Then the hypotheses become Ho : e = and 



1 rt 



Ha : £ > 0. The rejection criterion in the theorem is equivalent to Z„ > z{l ~ a) where Z„ is defined as in Equation 
©. Then B[Z„\Ho] = and E = e > 0. 

In the row-wise binomial framework with A'^n ~ BIN(ni , tth) and N21 ~ BIN(n2, 7ri2) being independent, consider 
T„ = (iVii/ni - N2i/n2). Under Ho, E[Tn\Ho] = and under Ha, the expected value of T„ becomes E [T„\Ha] = 
e > 0. Under both null and alternative hypotheses the test statistic {T„ — E[T„])/^Var[r„] has normal distribution 
asymptotically. By an appropriate application of Slutsky's Theorem, Z„ — {Nn/ni — N2i/n2) ^ ^c^c^ ' ^-nd {Tn — 

E[r„])/-\/Var [r„] have the same asymptotic distribution. Hence the size of the test is a. Furthermore, consistency 
follows by the asymptotic normality. The consistency for spatial association for the two-class case follows similarly. 

In the multinomial framework with N = {Nn, N12, N21, N22) ~ ./#(n, i^iki, J'iK2, J'2Ki, t'2S:2), we parametrize the 
segregation alternative as Ha : E [Nn/n] = ui ki +e. Consider the test statistic T„ := N\i /n — i/\ Hi. Then expected 
value of Tn under Ho is E[r„|_ffo] = and (r„ — E[r„])/-\/Var[r„] is approximately normal for large n under both null 
and alternative hypotheses. By an appropriate application of Slutsky's Theorem and some algebraic manipulations, 
one can see that (r„ - E[r„])/VVar[r„] and Zn given in Equation (jSj are asymptotically equivalent and converge 
in law to the standard normal distribution under Ho and the same normal distribution under Ha- So, the test has 
size Q and using r„, consistency of the test for fff follows. The consistency for spatial association for the two-class 
case follows similarly. 

(II) Under RL or CSR independence, E[T„1 = — 3. VlL- — 3_ which converges to zero as n — > cx). Let 

n— 1 n — 1 n — 1 

Var/j[T„] be the variance of T„ under RL or CSR independence and Var/[r„] be the variance of T„ under the case 
that (base, NN) pairs are independent. Then our claim (which is proved below) that 

Vam[r„] > Var/[T„] (9) 

for large n. Hence, T„ rejects RL or CSR independence in favor of segregation or spatial association more frequently 
than it should; i.e., it is liberal in rejecting these null patterns, or equivalently, its nominal significance level is larger 
than the prespecified level a. However, under the above parametrization of segregation, we have E [r„|_fff ] = e > 
and Var//[T„] converges to zero as n ^ 00. Hence, consistency for segregation follows. Consistency for the spatial 
association alternative follows similarly. ■ 

Proof of the Claim in Equation 

For large n, the probabilities in Equation (O take the form 

P^i = Pij = Vj, Piii = i^f , Piij = I'f Vj, Piijj = i^f v'^j , Piiii = vt for i,j G {1, 2}. 

Then variance of Tn under RL or CSR independence is 

Var//[r„] = Var[Afn/ni] + Var[iV2i/n2] - 2 Cov[Afii/ni, iV2i/n2]. 
Using Equation ((6)1 in Section TS . 3 . II and Equation (6) in (|Dixonl (|2002al )). we have, 

Var[7Vii/ni] = — \{n + R) vl + {2n - 2R + Q)vl + {n^ ~ in ^ Q + R)vt ~ [nvlf] 



and 



1 



\n + R + {2n - 2 R + Q) 1^1 + [R - 3n - Q) i^t] 



Var [Ai'21 / n2] = ^2.3 '^n^2 + Q fi t^i + {n^ - ^71 — Q + R) vl v\ - (n v\ 1^2)^^ 

\nv\+Qv\V2^{R — 2>n — Q)vi V2] 



and 



CovWii/ni, A''2i/n2] = ^5 \(n ~ R — Q) Vi 1/2 + (n — 3n — Q + R) Vi V2 ~ n Vi 1/2] 

V1V2 



^ \ \{n- R-Q)vi + {R-in-Q) vf] 



Hence by algebraic manipulations, we get 



Var/j r„ = \ 



Furthermore, under RL or CSR independence, Ki = i/i for i = 1,2. Then for large n, variance of r„ under RL or 
CSR independence is 

-tr 1 7rii(l-7rii) 7r2i(l-7r2i) - Ki) U2 Ki{1 - 1/2 Ki) 

Varj J„ = \ = 1 

ni n2 nui n V2 

_ uf{l - vl) ^ V2 V\{1 - V2 V\) _ - v'i) _|_ - V2 V\) _ V\ (2 - V\) 

nv\ nv2 nv\ nv2 n 

Then we need to show that, ^ _|_ ^ = ]^ _|_ :5 ^ > (2 — i^i) which trivially follows, since 1 > ui (2 ~ i^i) 

n U2 n U2 

which follows from — 1)^ > Q. ■ 



Proof of Theorem 14.21 

(I) Suppose under Ho, NNCT is based on a random sample of (base, NN) pairs. In the two-class case, deviations 
from Ho are as in Theorem 14.11 With such a deviation from Ho; i.e., under Ha, for large n, Xp is approximately 
distributed as a distribution with non-centrality parameter \{e) and 1 degrees of freedom (d.f.), which is denoted 

y^{\{e)). The non-centrality paramet er is a q uadra tic form which can be written as /^(e)' A^l{e) for some positive 
definite 2x2 matrix A of rank 1 (see iMoseij (|l996h ) hence A(e) > under Ha- Then as n — > oo, the null and 
alternative hypotheses are equivalent to Ho ; A = versus Ha ■ A — A(e) > 0. Then the size of the test is a and the 
consistency follows. 

(II) Under RL or CSR independence, the dependence in the row sums or column sums, which causes the reduction 
in d.f. is not the only type of dependence present in the NNCT. In addition to this, the NNCT cell counts are also 
dependent due to reflexivity and shared NN structure. Hence, even under RL or CSR independence, the corresponding 
distribution is still a scaled version of central distribution, but has a larger variance than Xi distribution, hence 
the nominal level of the test is larger than the prespecified a. On the other hand, as n — > oo, the power of the test 
goes to 1. ■ 



Proof of Theorem 14.31 

Consider the one-sided alternative with Ha : Hij > i^ij. Let r„ = Nij jn — Vij. Then (r„ - E[r„])/VVar[T„] = Z^,. 
Under RL, E[T'„] = and Var[r„] is given in Equation ((Gjl. Consider the parametrization of the alternative as 
Ha •■ TTij — Vij + e for eg (0, 1 — va). Then under Ha, E[Tii|_ffa] = e > 0. As Tii ^ oo, Na have asymptotic 
normal distribution (|Cuzick and Edwards! (|l990t )). This implies A^i2 is also asymptotically normal, as ni — » cxd, since 
N\2 = ni — A''ii. Similarly, A^2i has asymptotically normal distribution as n2 — > oo. Thus under both null and 
alternative hypothesis, (T„ — E[r„])/ -^/Var [r„] has asymptotic normal distribution. Then the size of the test is oi 
and consistency follows. The consistency for the other types of alternatives follow similarly. ■ 



Proof of Theorem 14.41 



Under RL, Y = N — E[N] is approximately distributed as A''(0, E) for large n. Let E~ be the generalized inverse of E 
whose rank is 2. Then by Theorem 3.1.2 of iMoseJ ()l996l ). under Ho, Y'E^Y ~ X2(-^ ~ 0). Hence the test has size a. 
Consider any deviation from Ho- Then under Ha, E[Y|_ffa] = fJ-a ^'id Y have multivariate normal distribution with 
mean /x^. Then by Theorem 3.1.2 of lMoseJ (|l996l ). under Ha, Y'E"Y ~ xU^ 

= Ai^jE ^a)- Since E is positive 

definite and /x^ is nonzero, the mean of the quadratic form X^, is A + 2 with A > 0. So for large A'', the null and 
alternative hypotheses are equivalent to Ho : A = and Ha : A > 0. Then consistency follows. ■ 
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Table 1: The NNCT for two classes. 
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Figure 1: The fixed locations of points for which RL procedure is apphed for RL Cases (2)-(4) with n = 200 
(for the case with ni = 712 = 100) in the two-class case. Notice that x-axis for RL Case (4) is differently 
scaled than others. 
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Figure 2: The scatter plots of the locations of black gum trees (solid circles) and bald cypress trees (triangles). 
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Figure 3: Second-order properties of swamp tree data. Functions plotted are Ripley's univariate L- functions 
Lii{t) — t for i = 1,2, and bivariate L-function Li2(i) — t where i = 1 for black gums and i = 2 for bald 
cypresses. The thick dashed lines around are the upper and lower 95 % confidence bounds for the L- 
functions based on Monte Carlo simulation under the CSR independence pattern. Notice also that vertical 
axes are differently scaled. 
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Table 2: The empirical significance levels for the under RL cases (l)-(4) at a = .05. Here af^ is the 
empirical significance level for Dixon's cell-specific test for cell for i e {1,2}, ap is for the right-sided 
version of Pielou's test, is for left-sided version of Pielou's test, ap and apy are for Pielou's overall test 
of segregation without and with Yates' correction, respectively, Sd is for Dixon's overall test of segregation. 
= :The empirical size is significantly smaller than .05; i.e., the test is conservative. ^:The empirical size is 
significantly larger than .05; i.e., the test is liberal. 
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Table 3: The empirical significance levels for the tests under Ho : CSR independence with N„ic = 10000, 
ni,n2 € {10,30,50,100} at a = .05 for uniform class X and Y points in the unit square. Here af.^'^^ is 
the empirical significance level for the QR-adjusted cell-specific test for for i G {1,2}, and ao^qr is 
for QR-adjusted Dixon's overall test of segregation. Other empirical size estimate notation and superscript 
labeling are as in Table [2l 



Proportion of agreement between Pielou's and Dixon's overall tests 




(10,10) 


(10,30) 


(10,50) 


(30,30) 


(30,50) 


(50,50) 


(50, 100) 


(100,100) 


RL Case (2) 


.0255 


.0245 


.0344 


.0317 


.0275 


.0321 


.0302 


.0336 


RL Case (3) 


.0226 


.0189 


.0321 


.0323 


.0275 


.0331 


.0289 


.0320 


RL Case (4) 


.0208 


.0192 


.0335 


.0329 


.0250 


.0292 


.0298 


.0355 


CSR independence 


.0277 


.0242 


.0340 


.0296 


.0270 


.0321 


.0342 


.0314 



Table 4: The proportion of agreement between Pielou's and Dixon's overall segregation tests apjj for 
rejecting the RL Cases (2)-4 and CSR independence with N^c — 10000, ni,n2 S {10, 30, 50, 100} at a — .05. 
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Table 5: The empirical significance levels for the tests under independence of cells and rows with N^c = 
10000, ni,n2 G {10,30,50,100} at a = .05 for contingency tables based on the overall multinomial and 
row- wise binomial frameworks. The empirical size notation and superscript labeling are as in Table [H 
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cell-specific 


one-sided 


overall 


"-2) 






a.R 




ap 


apY 


ao 


(10,10) 


.0464- 


.0442^'- 


.0811*'- 


.1402*'< 


.1169*'< 


.0506< 


.0604^'> 


(10,30) 


.0317'='- 


.0602*'> 


.0830*'- 


.1296*'< 


.1283*'< 


.0530- 


.0572*'> 


(10,50) 


.0258'='- 


.0526> 


.0946*'- 


.0502< 


.0625*'- 


.0294'='- 


.0569*''> 


(30,10) 


.0565*-> 


.0289^'- 


.0799 *'- 


.1273*'< 


.1235*'< 


.0485- 


.0540'^'> 


(30, 30) 


.0518- 


.0522- 


.0782*'- 


.1104*'- 


.1284*'- 


.0848*'- 


.0545'''> 


(30, 50) 


.0608*'- 


.0554*'- 


.0886*'- 


.1088*'< 


.1253*'- 


.0769*'- 


.0521> 


(50,10) 


.0537*'- 


.0269'='- 


.0959*'- 


.0553*'- 


.0669*'- 


.0297*'- 


.0599*'> 


(50,30) 


.0573*'- 


.0640*'- 


.0870*'- 


.1122*'- 


.1257*'- 


.0822*'- 


.0540*''> 


(50,50) 


.0463'='- 


.0471- 


.0820*'- 


.0998*'- 


.1330*'- 


.0946*'- 


.0510- 


(50, 100) 


.0513< 


.0519- 


.0939*'- 


.1040*'- 


.1209*'- 


.0905*'- 


.0544*'* 


(100,50) 


.0527- 


.0497< 


.0906*'- 


.1038*'- 


.1211*'- 


.0903*'- 


.0481- 


(100, 100) 


.0473- 


.0475- 


.0922*'- 


.1069*'- 


.1314*'- 


.1039*'- 


.0493- 



Table 6: The empirical significance levels for the overall tests under Ho with Nmc ~ 10000, n\,n2 S 
{10,20,30,40,50} at a = .05 for uniform class X and Y points in the unit square when edge correction 
with buffer zone is applied. The empirical size notation is as in Tabled) '=:The empirical size is significantly 
smaller than .05; i.e., the test is conservative. *:The empirical size is significantly larger than .05; i.e., the 
test is liberal. ^(^):Empirical size significantly smaller (larger) than the uncorrected size. ~: Empirical size 
not significantly different from the uncorrected size. 



Empirical significance levels for the NNCT-tests with toroidal edge correction 
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Table 7: The empirical significance levels for the overall tests under with N^c = 10000, ni,n2 G 
{10,20,30,40,50} at a = .05 for uniform class X and Y points in the unit square when toroidal edge 
correction is applied. The empirical size notation and superscript labeling are as in Table [SI 
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Table 8: The NNCT for swamp tree data (top left) and the artificial data (bottom left) and the corresponding 
percentages (riglit). B.G. = black gums, B.C. = bald cypresses. 
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Table 9: The values of the NNCT-test statistics and the corresponding p- values (in parenthesis) for the 
swamp tree data (top) and artificial data set (bottom). X^: Dixon's overall segregation test, X^ and X^y-. 
Pielou's test without and with Yates' correction, respectively, Z„: directional Z-tcst. The p-values are for 
the general alternative of any deviation from GSR independence except for Z„, for which the first p- value in 
the parenthesis is for the association alternative, while the second is for the segregation alternative. 
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A Summary of the NNCT-Tests 
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Table 10: The summary of underlying assumptions, appropriate null hypotheses Ho, and the type of conditioning for the tests based on NNCTs. Xp stands 
for Piclou's overall test, Zn for one-sided version of Pielou's test, X^) for Dixon's overall test, and Z[j for Dixon's ccU-spccific test. Pielou's overall test with 
Yates' correction has the same properties as the uncorrected one (hence not presented). NA stands for "not applicable". 
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Figure 4: The scatter plots of the locations of X points (solid circles) and Y points (triangles). 




Figure 5: Second-order properties of the artificial data. Functions plotted are Ripley's univariate L-functions 
Lii{t) — t for i = 1,2, and bivariate L-functions Li2{t) — t where i = 1 for X and i = 2 ioi Y points. The 
thick dashed lines around are the upper and lower 95 % confidence bounds for the L-functions based on 
Monte Carlo simulation under the CSR independence pattern. Notice also that vertical axes are differently 
scaled. 



